Measuring the phylogenetic randomness of biological data sets.

نویسندگان

  • W H Day
  • G F Estabrook
  • F R McMorris
چکیده

Two qualitative taxonomic characters are potentially compatible if the states of each can be ordered into a character state tree in such a way that the two resulting character state trees are compatible. The number of potentially compatible pairs (NPCP) of qualitative characters from a data set may be considered to be a measure of its phylogenetic randomness. The value of NPCP depends on the number of evolutionary units (EUs), the number of characters, the number of states in the characters, the distributions of EUs among these states, and the amount and distribution of missing information and so does not directly indicate degree of phylogenetic randomness. Thus, for an observed data set, we used Monte Carlo methods to estimate the probability that a data set chosen equiprobably from among those identical (with respect to all the other above determining features) to the observed data set would have as high (or low) an NPCP as the observed data set. This probability, the realized significance of the observed NPCP, is attractive as an indication of phylogenetic randomness because it does not require the assumptions made by other such methods: No character state trees are assumed and consequently, only potential compatibility can be determined; no particular method of phylogenetic estimation is assumed; and no phylogenetic trees are constructed. We determined the values and significances of NPCP for analyses of 57 data sets taken from 53 published sources. All data sets from 37 of those sources exhibited realized significances of < 0.01, indicating high levels of phylogenetic nonrandomness. From each of the remaining 16 sources, at least one data set was more phylogenetically random. Inclusion of outgroups changed significance in some cases, but not always in the same direction. Data sets with significantly low NPCP may be consistent with an ancient hybrid origin (or other ancient polyphyletic gene exchange, crossing over, viral transfer, etc.) of the study group.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Tanacetum zahlbruckneri (Compositae-Anthemideae), an enigmatic record from Iran and its phylogenetic position

Tanacetum zahlbruckneriis reported as a new record from northwestern Iran. A complete description, diagnostic characters and a distribution map of T. zahlbruckneriare presented. The phylogenetic position of T. zahlbruckneri in a concise molecular framework based on the sequences of ITS nrDNA data are presented. A taxonomic conclusion about the previously reported samples of this enigmatic speci...

متن کامل

Phylogeny of some species of Astragalus (Fabaceae) based on morphological data

The phylogenetic relationships among 39 species belonging to 12 sections of Astragalus from Iran were studied on the basis of 29 morphological characters. The cladistics analysis of the morphological data was performed using PAUP* 4.0b10 program. The obtained data were compared with the molecular systematics data obtained from nuclear DNA ITS. In contrast with previous molecular systematic stud...

متن کامل

Walking on real numbers

Motivated by the desire to visualize large mathematical data sets, especially in number theory, we offer various tools for representing floating point numbers as planar (or three dimensional) walks and for quantitatively measuring their “randomness.”

متن کامل

A Randomness Test for Stable Data

In this paper, we propose a new method for checking randomness of non-Gaussian stable data based on a characterization result. This method is more sensitive with respect to non-random data compared to the well-known non-parametric randomness tests.

متن کامل

Phylogenetic relationships in Ranunculus species (Ranunculaceae) based on nrDNA ITS and cpDNA trnL-F sequences

The genus Ranunculus L., with a worldwide distribution, is the largest member of the Ranunculaceae. Here, nuclear ribosomal internal transcribed spacer (ITS) sequence data and chloroplast trnLF sequence data were used to analyze phylogenetic relationships among members of the annual and perennial (Group Praemorsa, Group Rhizomatosa, Group Grumosa and Group non-Grumosa) species of Ranunculus...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Systematic biology

دوره 47 4  شماره 

صفحات  -

تاریخ انتشار 1998